On Rounds in Quantum Communication 



Hartmut Klauck 
CWI 

(N ! P.O. Box 94079 

£^ ; 1090 GB Amsterdam, the Netherlands 

klauck@cwi .nl 

00 

^ Abstract 

| We investigate the power of interaction in two player quantum com- 

munication protocols. Our main result is a rounds-communication hi- 
erarchy for the pointer jumping function We show that fk needs 
quantum communication fi(n) if Bob starts the communication and 
the number of rounds is limited to k (for any constant k). Trivially, if 
■ Alice starts, O(fclogn) communication in k rounds suffices. The lower 

bound employs a result relating the relative von Neumann entropy be- 
tween density matrices to their trace distance and uses a new measure 
of information. 

We also describe a classical probabilistic k round protocol with 
communication 0(n/k- (log' fe ^ 2 ^ n + logfc) + A:logn) in which Bob starts 
the communication. 
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^ i Furthermore as a consequence of the lower bound for pointer jump- 

ed ing we show that any k round quantum protocol for the disjointness 

problem needs communication f2(n 1 / fe ) for k = 0(1). 

X 

b ! 1 Introduction 

cz i 

Quantum mechanical computing and communication has been studied ex- 
tensively during the last decade. Communication has to be a physical pro- 
cess, so an investigation of the properties of physically allowed communica- 
tion is desirable, and the fundamental theory of physics available to us is 
quantum mechanics. 

The theory of communication complexity deals with the question how effi- 
cient communication problems can be solved and has various applications to 
lower bound proofs (introduction to (classical) communication complexity 
can be found in |l9|). The communication complexity approach to lower 
bounds consists of reducing a lower bound proof for some computational 
model to a communication complexity lower bound, where several tech- 
niques for such proofs are available, see Jl^j for many examples. 
In a quantum protocol (as defined in |}(]]) two players Alice and Bob each 
receive an input, and have to compute some function defined on the pair 
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of inputs cooperatively. To this end they exchange messages consisting of 
qubits, until the result can be produced by some measurement done by one 
of the players (for overviews on quantum communication complexity see 1 2£ ] 
and |||). 

A slightly different scenario proposed in [||] and || allows the players to start 
the protocol possessing some (input independent) qubits that are entangled 
with those of the other player. Due to the "superdense coding" technique 
of Q in this model 2 classical bits can be communicated by transmitting 
one qubit (and using up one EPR pair). See [|30|, |l5| for some examples of 
lower bounds via communication complexity in the quantum setting. Un- 
fortunately so far only few "applicable" lower bound methods for quantum 
protocols are known: the rank lower bound is known to hold for exact (i.e., 
errorless) quantum communication j|, ||], the (usually weak) discrepancy 
lower bound for bounded error protocols [18]. 

One breakthrough result of the field of quantum computing is Grover's 
search algorithm that retrieves an item from an unordered list within 0(y/n) 
questions |11], outperforming every classical algorithm for the problem. By 
an application of this search algorithm to communication complexity in Q 
an upper bound of 0(y/n log n) is shown for the bounded error quantum 
communication complexity of the disjointness problem DISJ n (both play- 
ers receive an incidence vector of a subset of {1, . . . , n} and have to decide 
whether the sets are not disjoint: \/(xi A yi)), one of the most important 
communication problems. This yields the largest gap between quantum and 
classical communication complexity known so far for a total function, the 
probabilistic communication complexity of disjointness is O(n) |]l3[ . Cur- 
rently no superlogarithmic lower bound on the bounded error quantum com- 
munication complexity of the disjointness problem is known. 
Unfortunately the protocol for disjointness using Grover search takes Q(^/n) 
rounds, an unbounded increase in interaction compared to the trivial pro- 
tocol communicating n bits in 1 round. Similar phenomena show up in 
the polynomial gaps between quantum Las Vegas and probabilistic bounded 
error communication complexity for total functions, see || 
We are interested in the question how efficient total communication prob- 
lems can be solved in the quantum model when the number of rounds is 
restricted. The most severe restriction is one-way communication, where 
only a monologue is transmitted from one player to the other, who then 
decides the function value. This has been investigated e.g. in J15| |, where a 
lower bound method based on the so called VC-dimension is proved, which 
allows to prove an exponential advantage for 2 round classical compared 
to 1 round quantum communication complexity. Kremer |18| exhibits a 



Exponential gaps between quantum communication complexity and classical proba- 
bilistic communication complexity are known only for partial functions, and are possible 
without interaction IBS, ffl. 
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complete problem for the class of problems with polylogarithmic quantum 
one-way communication complexity (in the case of bounded error). 
In a series of papers (see [10], [|l^], [24], [25], (l4|]) more general round hier- 
archies of the following form are given for classical protocols: A function fk 
(usually the so called pointer jumping function) has k round communication 
complexity k log n if Alice starts the communication, but has a much larger 
k round communication complexity when Bob starts. 

Our main result is that /c-round quantum protocols need communication 
n/2 2 ° (fc) — k log n to compute the pointer jumping function when Bob starts. 
So changing the starting player (or reducing the number of rounds by 1) may 
result in drastically increased communication also in the quantum case. 
Nayak et. al. [22] have proved a lower bound of £l(n}' k ) for the quantum 
communication complexity of a certain subproblem of pointer jumping, in 
the situation when B starts and k rounds are allowed. 

We begin our consideration of the complexity of pointer jumping (in section 
5) with the description of a classical randomized protocol for pointer jumping 
using communication 0(n/k ■ (log^^ 2 ^ n + log A;) + fclogra) in the situation 
where Bob starts and only k rounds are allowed. This upper bound is close 
to the known lower bounds for classical protocols p3, 15]. 



The general strategy of our new lower bound for pointer jumping is to bound 
the value of a certain measure of information between the qubits of one player 
and the "next" pointer in terms of the analogous quantity for the previous 
pointer plus the average information on pointers in possession of the other 
player. 

The mentioned protocol makes clear why the usual measure of information 
does not work in this approach. So (after defining the main notions of quan- 
tum computing in section 2 and the model of communication complexity in 
section 3) we introduce a new measure of quantum information in section 
4. This measure is tied to the usual, von Neumann measure of quantum in- 
formation by a theorem, which connects the trace distance between density 
matrices to their relative von Neumann entropy. 

The lower bound on pointer jumping implies via reductions lower bounds 
for the k round bounded error quantum communication complexity of the 
disjointness problem of the order f2(n 1//fc ) for all constant k, see section 6. 
We conclude from our result that quantum communication is dependent on 
interaction, as one should expect for a "realistic" mode of communication. 
We also conclude that good speedups by quantum protocols imply the use 
of nontrivial interaction in the case of total functions: for an asymptotic 
speedup by quantum Las Vegas protocols always more than one round is 
necessary |jlS[ |. By the results in this paper (and similar results in |22]| ) 
rounds are also crucial in quantum speedups for the disjointness problem. 
The lower bound for a subproblem of pointer jumping given in |z| and the 
lower bound in this paper use at the basis of the proofs similar techniques. 
The main ingredient of the proof in |23], the "average encoding theorem", 
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follows directly from our theorem 1, which states a connection between a 
new measure of information (based on the trace distance) and the relative 
von Neumann entropy. We make use of a fact from [22], namely the "local 
transition theorem". A combined version of both papers appears in jT?]]. 
Our results also hold in the model, in which prior entanglement is available. 
The paper is organized as follows: In the next section we give some back- 
ground on quantum mechanics. Then we define the communication model in 
section 3. In section 4 we consider measures of information and entropy. In 
section 5 we prove our results on the complexity of pointer jumping. Section 
6 contains the lower bound for the disjointness problem. 



2 Quantum States and Transformations 

Quantum mechanics is a theory of reality in terms of states and transfor- 



mations of states. See [23, [26[ for general information on this topic with an 
orientation on quantum computing. 

In quantum mechanics pure states are unit norm vectors in a Hilbert space, 
usually C fc . We use the Dirac notation for pure states. So a pure state 
is denoted \<j>) or J2 x e{o,...,k-i} a x\ x ) wrt h Z)xe{0,...,fc-i} \ a x\ 2 = 1 an d with 
{ \x) \x G {0, . . . , k — 1}} being an orthonormal basis of C fe . 
Inner products in the Hilbert space are denoted (4>\ip), outer (matrix valued) 
products \(j))(ip\. 

If k = 2 l then the basis is also denoted { \x) \x 6 {0, 1}'}. In this case the 
space (D 2 is the I -wise tensor product of the space C 2 . The latter space is 
called a qubit, the former space consists of I qubits. 
Usually also mixed states are considered. 

Definition 1 Let {(pi, \4>i))\i = 1, • • • , &} with J2iPi = 1 an d Pi £ [0) 1] ^ e 

an ensemble of pure states of a quantum system, also called a mixed state. 
Pi = \<Pi){<Pi\ is the density matrix of the pure state \<pi). J2iPiPi is the 
density matrix of the mixed state. 

For a bipartite system with density matrix pab denote pa = traceBPAB- 
As usual measurements of certain observables and unitary transformations 



are considered as basic operations on states, see [23, 26 1 for definitions. 
For all possible measurements on a mixed state the results are determined 
by its density matrix. In quantum mechanics the density matrix plays an 
analogous role to the density function of a random variable in probability 
theory. Note that a density matrix is Hermitian, positive semidefinite and 
has trace 1. Thus it has only real, nonnegative eigenvalues that sum to 1. 
Linear transformations on density matrices are called superoperators. 

Definition 2 A superoperator is a linear map on density matrices. A su- 
peroperator is positive, if it sends positive semidefinite matrices to positive 
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semidefinite matrices. A superoperator is called completely positive, if its 
tensor product with the identity superoperator is positive on density matri- 
ces over each finite dimensional extension of the underlying Hilbert space. 

Trace-preserving completely positive superoperators map density matrices 
to density matrices and capture all physically allowed transformations. These 
include unitary transformations, tracing out subsystems, forming a tensor 
product with some constant qubits, and general measurements. 
The following important fact characterizes the allowed superoperators in 
terms of unitary transformations and tracing out (see p6|| ). This fact is 
known as the Kraus representation theorem. 

Fact 1 The following statements are equivalent: 

1. A superoperator T sending density matrices over Hi to density matri- 
ces over H2 is trace preserving and completely positive. 

2. There is a Hilbert space H3 with dim{H^) < dim(Hi) and a unitary 
transformation U , such that for all density matrices p over H\ the 
following holds: 

Tp = trace Hl ®Hi[ u (P® \ q h 3 ®h 2 )(0h 3 ®h 2 \)U' ]( ]. 

So allowed superoperators can be simulated by adding some blank qubits, 
applying a unitary transformation and tracing out, i.e., "dropping some 
qubits" . 

Definition 3 A purification of a mixed state with density matrix p over 
some Hilbert space H is any pure state \<p) over some space H ®K such that 
traceK\4>) = P- 

3 The Communication Model 

In this section we provide definitions of the computational models consid- 
ered in the paper. We begin with the model of classical communication 
complexity. 

Definition 4 Let f : {0, l} n x {0, l} n — > {0, 1} be a function. In a commu- 
nication protocol player Alice and Bob receive x and y and compute f(x, y). 
The players exchange binary encoded messages. The communication com- 
plexity of a protocol is the worst case number of bits exchanged for any input. 
The deterministic communication complexity D{f) of f is the complexity of 
an optimal protocol for f . 

In a randomized protocol both players have access to public random bits. The 
output is required to be correct with probability 1 — e for some constant e. 
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The randomized communication complexity of a function R t (f) is then de- 
fined analogously to the deterministic communication complexity. We define 
R(f) = Ri /3 (f). 

A protocol has k rounds, if the players exchange k messages with Alice and 
Bob alternating as speakers. In message k + 1 one player announces the 
result. Alice protocol is called one-way if only one players sends a message, 
and then the other player announces the result. The complexity notations 
are superscripted with the number of allowed rounds and eventually with the 
player starting, like D k,B or D k (usually Alice starts). 

Now we define quantum communication protocols. For general information 



on quantum computation see [23] and [26] 



Definition 5 In a quantum protocol both players have a private set of qubits. 
Some of the qubits are initialized to the input before the start of the protocol, 
the other qubits are in state |0). In a communication round one of the play- 
ers performs some unitary transformation on the qubits in his possession 
and then sends some of his qubits to the other player (the latter step does 
not change the global state but rather the possession of individual qubits). 
The choice of qubits to be sent and of unitary operations is fixed in advance 
by the protocol. 

At the end of the protocol the state of some qubits belonging to one player 
is measured and the result is taken as the output. The communication com- 
plexity of the protocol is the number of qubits exchanged. 
In a (bounded error) quantum protocol the correct answer must be given with 
probability 1 — e for some 1/2 > e > 0. The (bounded error) quantum com- 
plexity of a function, called Q t (f), is the complexity of an optimal protocol 
forf. Q(/) = Qi/ 3 (/). 

In fact we will consider a more general model of communication complex- 
ity, in which the players can apply all physically allowed superoperators 
to their private qubits. But due to the Kraus representation theorem (see 
fact 1) this model can be simulated by the above model without increasing 
communication (with the help of additional private qubits). 
We have to note that in the defined model no intermediate measurements 
are allowed to control the choice of qubits to be sent or the time of the 
final measurement. Thus for all inputs the same amount of communica- 
tion and rounds is used. As a generalization one could allow intermediate 
measurements, whose results could be used to choose the qubits to be sent 
and possibly when to stop the communication protocol. One would have to 
make sure that the receiving player knows when a message ends. 
A protocol with k rounds in this more general model can be simulated while 
loosing a factor of at most k in the communication: for each measurement 
the operations given by the Kraus representation theorem are used. The 
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measurement's result is then stored in some ancilla qubits. Now the global 
state is a superposition over the results and a superposition of the appro- 
priate communications can be used as a communication. This superposition 
uses as many qubits as the worst case message of that round. This may 
be at most the complexity of the whole protocol, so the overall complexity 
increases by at most a factor of k. While this simulation may not be sat- 
isfactory in general, it suffices to keep our lower bound valid in the more 
general model. 

In [^] and || a different model of quantum communication (the communi- 
cation model with entanglement) is proposed. Alice and Bob may possess 
an arbitrary input-independent set of (entangled) qubits in the beginning. 
Then they communicate according to an ordinary quantum protocol. This 
model can be simulated by allowing first an arbitrary input-independent 
communication with no cost followed by a usual quantum communication 
protocol in which the cost is measured. The superdense coding technique 
of ||] allows to transmit n bits of classical information with |~n/2] qubits in 
this model. 



Definition 6 The quantum bounded error communication complexity with 



entanglement and error e is denoted Q^ ub (f). Let Q pub (f) = Q?/a(/)- 



For surveys on quantum communication complexity see [29] and [16] 



4 Quantum Information Theory 

Our main result in the next section uses information theory arguments. First 
we define the classical notions of entropy and information. 

Definition 7 Let X : f2 — > S be a random variable on finite sets $1,S (as 
usual the argument of X is dropped). The density function (or distribution) 
of X is px ■ S — > [0,1], where px{x) is the probability of the event X = x. 
The entropy of X is H(X) = - J2xts Px(x) log p x (x). 
Let X,Y be random variables over O. The joint density function of XY 
is pxy( x ,u)- The information between X and Y is H(X : Y) = H(X) + 
H(Y) - H(XY). 

We use the convention OlogO = 0. 

Now we define the quantum mechanical notions of entropy and information. 



Definition 8 The von Neumann entropy of a density matrix px is de- 
fined by S(X) = S(px) = — trace(px logpx)- The relative von Neumann 
entropy between two density matrices p,a of the same size is S(p\\a) = 
trace(p(logp — log a)). This value may be infinite. 
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The von Neumann information is S(X : Y) = S(X) + S(Y) — S(XY) (see 
also Here S{X) is the von Neumann entropy of the reduced density 

matrix px = traceyPxY ■ 

The conditional von Neumann information is S(X : Y\Z) = S{XZ) + 
S{YZ) - S{Z) - S(XYZ). 

Note that the von Neumann entropy depends only on the eigenvalues of a 
matrix and is thus invariant under unitary transformations. If the under- 
lying Hilbert space has dimension d then the von Neumann entropy of the 
density matrix is bounded by log d. 

Not all relations in classical information theory hold for von Neumann en- 
tropy. The following fact contains the so-called Araki-Lieb inequality (*) 
and its consequences, which describes a notable difference to classical en- 
tropy (see 



Fact 2 For all bipartite states pxY'- 
S(X) + S{Y) > S{XY) > \S(X) - S(Y)\, 
S(X : Y) < 2S(X). 

Then also S(X :Y\Z)< S(X : Y Z) < 2S(X) holds. 

The following is an important property of the von Neumann entropy, see 



[26|. This property is known as the Lindblad-Uhlmann monotonicity of the 



von Neumann entropy. 

Fact 3 For all trace-preserving, completely positive super operators F and 
all density matrixes p,a: 

S(p\\a)>S(Fp\\Fa). 

We are going to introduce another measure of information based on the 
distinguishability between a bipartite state and the state described by the 
tensor product of its two reduced density matrices. Now we consider mea- 
sures of distinguishability. One such measure is the relative entropy. For 
probability distributions the total variational distance is another useful mea- 
sure. 

Definition 9Ifp,q are probability distributions on {1, . . . ,n} , then their 
distance is defined 



\\p-q\\ = ^2\p( 



„ ')-(/[ 0\- 

i=l 



The following norm on linear operators is considered in [0. 

Definition 10 Let p be the matrix of a linear operator. Then the trace 
norm of p, denoted \ \p\\i, is the sum of the absolute values of the elements 
of the multiset of all eigenvalues of p. In particular \ \p\\i = Tr(\/ p^ ■ p). 
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Note that the distance \\p — a\\\ is a real value for Hermitian matrices p, a. 
The trace norm has a close relation to the measurable distance between 
states as shown in 

Fact 4 For an observable O and a density matrix p denote p® the distribu- 
tion on the outcomes of a measurement as induced by O on the state p. 

\\p-a\\ 1 = max{||p° -p°\\}. 

So two density matrices that are close in the trace distance cannot be dis- 
tinguished well by any measurement. 

The next lemma is related to fact 4 and follows from fact 1. 

Lemma 1 For each Hermitian matrix p and each trace-preserving com- 
pletely positive superoperator F: 

\\p\\i>\\F( P )\\i. 

We employ the following theorem to bound the trace distance in terms of 
relative entropy. A classical analogue of the theorem can be found in || and 
has been used e.g. in p?]]. 

Theorem 1 For density matrices p, a of the same size: 

S(p\\a) > -?—\\p-a\\l. 

Proof: Since both the norm and the relative entropy are invariant under 
unitary transformations we assume that the basis of the density matrices 
diagonalizes p — a. Note that in general neither p nor a are diagonal now. 
Let S be the multiset of all nonnegative eigenvalues of p — a and R the 
multiset of all its negative eigenvalues. All eigenvalues are real since p — 
a is Hermitian. Now if the dimension of the space H$ spanned by the 
eigenvectors belonging to S has dimensions k and the space Hr spanned 
by the eigenvectors belonging to R has dimensions n — k, then increase 
the size of the underlying Hilbert space so that both spaces have the same 
dimension n' = max{/c,n — k}. The density matrices have zero entries at 
the corresponding positions. Now we view the density matrices as density 
matrices over a product space H2 <8> H n i , where the H2 space "indicates" the 
space Hs or Hr. 

We trace out the space H n i in p,a, p — a. The obtained 2x2 matrices are 
p,a, p — a. Note that the matrix p — a is diagonalized and contains the sum 
of all nonnegative eigenvalues, and the sum of all negative eigenvalues on its 
diagonal. Furthermore p — a = p — a. 
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Due to Lindblad-Uhlmann monotonicity of the relative von Neumann en- 
tropy we get S(p\\a) > S(p\\a). We will bound the latter by 

l/(21n2)||p- 5F||f = l/(21n2)||p~cr||?, 

and then conclude the theorem since the trace norm of p — a is the sum 
of absolute values of its eigenvalues which is the sum of absolute values of 
eigenvalues of p — a by construction, i.e., \\p — a\\i = \ \p — a\\i. 
So we have to prove the theorem only for 2x2 density matrices. Assume 
that the basis is chosen so that a is diagonal. Then 

P =(b* La) and ^=(o 1-c)- 

The relative von Neumann entropy S(p\\a) = —S(p) — trace [p log a]. The 
second term is — alogc — (1 — a) log(l — c). 

The first term is minus the entropy of the distribution induced by the eigen- 
values of p. So we compute the eigenvalues. 

The eigenvalues of p are the zeroes of its characteristic polynomial t 2 — t + 
a(l - a) - 66*. These are 1/2 ± y/1/4 - o(l - a) + 66*. Let x = 1/2 + 
v/l/4-a(l-a)+6&*. Then S{p) = H(x). 

The squared norm of p — a is the squared sum of absolute values of the 
eigenvalues of p — a. That matrix has the characteristic polynomial t 2 — 
(— a(l — a) + a(l — c) + (1 — a)c — c(l — c) + 66*). Thus its eigenvalues are 
±^—0(1 — a) + a(l — c) + (1 — a)c — c(l — c) + 66*. The squared norm as 
squared sum of absolute values of the eigenvalues is 

4(a 2 + c 2 -2ac + bb*). 

First we consider the case a = c. To prove this case we have to show 
that H(a) - H{x) > 21og(e)66* = 21og(e)[(x - 1/2) 2 - 1/4 + o(l - a)} = 
21og(e)[(x 2 -x) - (a 2 -a)]. 

Considering the function H(y) / log(e) + 2y 2 — 2y, we find that it is nonnega- 
tive and monotone decreasing for y between 1/2 and 1. Thus the inequality 
holds, when 1/2 < a and a < x. The first condition can be assumed w.l.o.g., 
and the second condition follows from the fact that x > 1/2 is an eigenvalue 
and a > 1/2 is a diagonal element. 

Now we look at the case c > a. If c < a, we can use the same argument for 
1 — c and 1 — a instead. We want to show that 

f(c) = S(p\\a)/log(e)-±\\p-a\\ 2 >0. 

We know this is true for a = c, so we show that increasing c cannot decrease 
the difference. This holds since: 

f'( c ) = - a /c + (1 - a) /{I - c) - 2(2c - 2a) 
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= {1 - a) '- a{ y c) -A(c-a) 
c(l - c) 

> 4(c - a) - 4(c - o) > 0. 

This yields the theorem for the 2x2 case and thus in general by the previous 
considerations. □ 
Note that for a bipartite state pab the following holds: 

S(A : B) = S(p AB \\pA® Pb) > ttt—^Wpab - PA ® Pb\\\- 

2m 2 

Thus the measurable distance between the tensor product state and the 
"real" bipartite state can be bounded in terms of the information. We will 
call the value D{A : B) = \ \pab — PA <8> Pb\\i the informational distance. 
The next lemma collects a few properties of informational distance that 
follow easily from the previous discussion. 



Lemma 2 For all states pabc the following holds: 

1. D(A : B) = D(B : A). 

2. D(AB : C) > D(A : C). 

3. < D(A : B) < 2. 

4- D(A : B) > \\F(pab) — F(pa ® Pb)||i f or a ll completely positive and 
trace-preserving superoperators F . 

5. D(A : B) < ^2S{A : B). 

Note that lemma 2.5 implies one of the main ingredients of the round hier- 
archy discovered in [^] (the "average encoding theorem"). 
Consider some density matrix pab that is block diagonal (with classical pa) 
in the basis composed as the tensor product of the standard basis for A and 
some other basis for B. Then denote the density matrix obtained by 
fixing A to some classical value a and normalizing. Pr{a) is the probability 
of a. 

The next properties of informational distance will be used later. 

Lemma 3 1. Let pab be the density matrix of a state, where ps corre- 
sponds to the density function of a classical random variable B on |0) 
and |1) with Pr(B = 1) = Pr(B = 0) = 1/2. Let there be a mea- 
surement acting on the A system only and yielding a Boolean random 
variable X with Pr(X = B) > 1 — e and Pr(X ^ B) < e (while 
the same measurement applied to pa® Pb yields a distribution with 
Pr(X = B) = Pr(X ^ B) = 1/2). Then D(A : B) > 1 - 2e. 
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2. For all block diagonal pab, where pa corresponds to a classical distri- 
bution pa on the standard basis vectors for A, the following holds: 

D(A:B) = E a \\pf - PB \\ X . 

PROOF: For the first item observe that D(A : B) > D(X : B) > 1 - 2e. 
The second item is a consequence of D(A : B) = \ \pab — Pa® Pb\\i- d 

5 Rounds in Quantum Communication 

It is well known that for deterministic, probabilistic, (and even limited non- 
deterministic) communication complexity there are functions which can be 
computed much more efficiently in k rounds than in k — 1 rounds (see pi], 



[12 1, [24 1, [25], |l4j). In most of these results the pointer jumping function 



is considered. 

Definition 11 Let Va an d Vb be disjoint sets of n vertices each. 
Let F A = {f A \f A ■ V A - V B }, and F B = {f B \f B : V B - V A }. 

tt \ t i \ J f'A(v) ifve V A , 
/(«) = W«) = ( fB(v) ifvE V B . 

Define f^(v) = v and / (fe) (w) = /(/ (fe-1) («))- 

Then g k : F A xF B -> (Va U V b ) is defined by g k (fA, fs) = ff^fl where 
vi & Va is fix. The function fk : Fa x F b — > {0, 1} is the XOR of all bits in 
the binary code of the output of g k ■ 



Nisan and Wigderson proved in [|24j that has a randomized k round 
communication complexity of Q(n/k 2 — /clogn) if B starts communicating 
and a deterministic k round communication complexity of /clogn if Alice 
starts. The lower bound can also be improved to Q(n/k + k), see |lE(| . 
Nisan and Wigderson also describe a randomized protocol computing gj. 
with communication 0((n/k)\ogn + /clogn) in the situation, where Bob 
starts and k rounds are allowed. Ponzio et. al. show that the deterministic 
communication complexity of f k is 0(n) then, if k = 0(1) [P5[ . 
With techniques similar to the ones in this paper we can also show a lower 

bound of j — gfcT ^ 1°§ n f° r the randomized k round complexity of when 

B starts, which is better than the above lower bounds for small constant 
values of k. 

Interaction in quantum communication complexity has also been investi- 
gated by Nayak, Ta-Shma, and Zuckerman p2;| . For the pointer jumping 
function their results imply the following: 

Fact 5 Q B ' k (f k ) = Q^^/k 4 ). 



First we give a new upper bound. The next result combines ideas from 24] 



and |5|. 
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Theorem 2 R^ B (g k ) < 0(£ • (log (fc/2) n + log /c) + fclogn). 

Proof: First Bob guesses with public random bits (4/e) • (n/k) vertices. 
For each chosen vertex v Bob communicates the first log^ fc//2 ^ n + 3 log fc bits 
of /b{v). 

In round i the active player communicates the pointer value vt = /^"^(ui). 
If it's Alice's turn, then she checks, whether vt is in Bob's list of the first 
round. Then Alice knows log^ 2 -* n + 31ogfc bits of /b(^). Note that this 
happens with probability 1 — e during the first k/2 rounds. In the following 
assume that this happened in round i < k/2, otherwise the protocol errs. 
Beginning from the round i, when Alice gets to know the log^ 2 ) n + 3 log k 
bits of f{vi) the players communicate in round i + t for all possible values 
of f(vi +t ) the most significant log^ 2- *** n + 31ogfc bits. Since there are at 
most n/^log^/ 2- *) n • fc 3 ) such values 0(n/fc 2 ) bits communication suffices. 
In the last round v k +2 is found. Overall the communication is at most 

fclogn + 0((l/e) • (n/fc)(log( fc / 2 )n + 31ogfc)) + fc-0(n/fc 2 ). □ 

Corollary 1 If k > 21og*(n) then R k ' B {g k ) < 0((f + k) log fc). 

We can replace fc log n by fc log fc in the above expression, because that term 
dominates only if logfc = 0(logn). 

Next we are going to prove a lower bound on the quantum communica- 
tion complexity of the pointer jumping function f k , for the situation that 
fc rounds are allowed and Bob sends the first message. We will consider a 
quantity dt capturing the information the active player has in round t on 
vertex t + 1 of the path. This quantity will be the informational distance 
between the active player's qubits and vertex t + 1. Our goal will be to 
bound dt in terms of <i t _i plus a term related to the average information on 
pointers in the other player's input. This leads to a recursion imposing a 
lower bound on the communication complexity, since in the end the protocol 
must have reasonably large information to produce the output, and in the 
beginning the respective information is 0. 

The informational distance dt measures the distance between the state of, 
say, Alice's qubits together with the vertex t + 1 of the path, and the tensor 
product of the states of Alice's qubits and vertex t + 1. In the product 
state Alice has no information on the vertex, so if the two states are close, 
Alice's powers to say something about the vertex are very limited. We will 
use the triangle inequality to bound dt by the sum of three intermediate 
distances. In the first step we move from the state given by the protocol to 
a state in which the t-th vertex is replaced by a uniformly random vertex, 
independent of previous communications. The distance to such a state can 
be bounded in terms of dt— l, because that quantity puts a bound on Bob's 
ability to detect such a replacement. We use the local transition theorem 
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from |^2| to conceal Alice's ability to detect such a replacement. Once the 
t-th vertex is random, we can move in the next step to a state in which also 
vertex t + 1 is random. The cost of this step corresponds to the average 
information a player has on a random pointer in the other player's input. 
The last step is similar to the first and reverses the first one's effect, i.e., 
replaces the "randomized" t-th vertex by its real value again. We arrive at 
the desired product state. 

Theorem 3 Q fc ' B ' pnfe (/fc) > n/2 2 ° W - Hogra. 

Proof: Fix a quantum protocol, of the following form. The protocol com- 
putes fk with error |, k rounds, Bob starting. At any time in the protocol 
Alice has access to qubits containing her input, some "work" qubits and 
some of the qubits used in messages so far, the same holds for Bob. We 
assume that the players never change their inputs. Usually a protocol gets 
some classical /a and fs as inputs, but we will investigate what happens if 
the protocol is started on a superposition over all inputs, in which all inputs 
have the same amplitude, i.e., on 

E ^\fA)\f B ). 

Note that {TaI = \Tb\ = n n . The superposition over all inputs is measured 
after the protocol has finished, so that a uniformly random input and the 
result of the protocol on that input are produced. 

The density matrix of the global state of the protocol is PM At M B t F A F B - 
Here Fa,Fb are the qubits holding the inputs of Alice and Bob and M.A,t 
resp. Mbj are the other qubits in the possession of Alice and Bob before 
the communication of round t. The state of the latter two systems of qubits 
may be entangled. In the beginning these qubits are independent of the 
input. 

We also require that in round t the vertex vt = f^~ l \vi) is communicated 
by a classical message and stored by the receiving player. This increases 
the communication by an additive klogn term. We demand that before 
round t the i'th vertex of the path is measured (remember that we are in a 
super-position over /a,/b)- This vertex is stored in some qubits Vt- V\ has 
the fixed value v\. In general, before the beginning of round t, we have a 
mixture and in each pure state in the mixture the first t— 1 vertices are fixed 
and Vt is either FA(vt-i) or Fb(u*-i). We then measure Vt in the standard 
basis. The measurements do not affect the correctness of the protocol. 
Let us denote dt = D(MbjFb '■ -Fyi(Vf)) when t is odd, and d t = D{Maj,Fa '■ 
FB(Vt)) when t is even. In this definition, we assume that the register i^(^t) 
(or FsiVt)) has been measured, although this measurement is not part of 
the protocol. Note that for t > 1, Vt is uniformly random, so (for odd t) 
the distance dt is taken as the average over v, of the informational distance 
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between the state of M B jF B restricted to Vt being equal to v, and Fa(v), 
and similarly for even t. 

We assume that the communication complexity of the protocol is 5n and 
prove a lower bound 5 > 2~ 2 ° <fc) . The general strategy of the proof is 
induction over the rounds, to successively bound d\, d,2, ■ ■ ■ , djfc+l- 
Bob sends the first message. As Bob has seen no message yet, we have that 
I(M B ,iF B : F A (Vi)) = 0, and hence d 1 = 0. We show that 

Lemma 4 dt+i < k\[d~t + 

We see that dt+i < StS 1 / 2 * for all t > 0. After round k one player, say Alice, 
announces the result which is supposed to be the parity of Fg^Vk+i) and 
included in Ma^+i- On the one hand dk+i = D(MA,k+i '■ F B (Vk+i)) < 
3 k 5 1 / 2h . On the other hand, by Lemma 3.1 D(M A , k +i : ®F B (V k+1 )) > 
1 - | = |. Together, \ < 2,H l l 2 \ so 5 > 2- 2 ° W . 
We now turn to proving Lemma 

W.l.o.g. let Alice be active in round t + 1. Let Ma = Maj+i and M B = 
M Bt t+\. Before the t + 1 round Vt+i = Fa(Vj) is measured. The resulting 
state is a probabilistic ensemble over the possibilities to fix Vj., . . . , Vt+i 3 
which are then classically distributed. Alice's reduced state is block diagonal 
with respect to the possible values of the vertices V%, ■ ■ ■ , Vt+\. For any value 

v of V t+1 let p" Ma m b f a Fb = Pm + aMbFaF b denote the pure state with vertex 
Vt+\ fixed to v. We are interested in the value 



dt+i = D(M A F A : F B {V t+1 )) = E. 



Pm a f a f b (v) - Pm a f a ® Pf s («) 



where the distribution on vertices w (induced by the state of the system Fa, F b ) 
is uniform. Recall that in the definition of dt+i, F B (v) is assumed to be uni- 
formly random (i.e., measured). The above quantity measures how much 
Alice knows about the value F B gives to the current vertex Vt+x- 
We define 



def 

lv = 



Pm b f b ~ Pm b f b 



(1) 



I.e., 7u is the distance between the state of Bob (Pm b f b ) before he receives 
the message in round t + 1, and the state pm b f B i which is his state averaged 
over v = F A (Vt). We show below that these two are almost always close to 
each other (this reflects the fact that Bob does not know much about Fa). 
For the purposes of the proof, we also consider a run of the protocol on 
the uniform superposition over inputs, where the qubits V\, V2, ■ ■ ■ are not 
measured during the course of the protocol. Let p~m a m b f a f b be the state 
before the communication in round t + 1 in this run of the protocol. For 
any v G V B , we define: 

Pv = f Pm a f a f b (v) ~ Pm a f a ® Pf b (v) , ( 2 ) 
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where Fb(v) is assumed to have been measured. Note that Pf b (v) = Pf b (v) 
with this measurement; both are randomly distributed over Va- 
Let Pm a m b f a f b r (respectively, Pm aMb f a f b r) be a purification of Pm a m b f a f b 
(resp. P v m a m b f a f b )i where R is some additional space used to purify the 
random path Vi, . . . , Vt+i (resp. Vi, . . . ,Vt). 

We employ the following fact from |22) (the "local transition theorem"). 
The fact is a variation of the impossibility result for unconditionally secure 
quantum bit commitment due to Mayers [21| and Lo and Chau [20|. 

Fact 6 Let pi,p2 be two density matrices with support in a Hilbert space H, 
K a Hilbert space of dimension at least dimH , and \ <t>2), any purifica- 
tions of p\ resp. p2 in H <g> K . Then there is a purification \(f> 2 ) of p2 in 
H ®K, that is obtained by applying a unitary transformation I<S>U to \4>2), 
where U is acting on K and I is the identity operator on H . 
|(^>2) has the property 



|0l)(0l|-|^>(^| 111 < 2^/| 



Pi - P2\\l- 



Now, due the above fact there is a local unitary transformation U v acting 
only on FaMaR such that 



v def T T T j\ 

a M A M B F A F B R — UvPM A M B F A F B RU v , 

and p"m a m b f a f b r are c l° se to each other. Moreover, 
Lemma 5 For all vertices v G Vb, 

< 



Pll A F A ~ ° V M A F A 



Pm a F a F b (v) a M A F A F B {v) 



a M A F A F B (v) ~ °M A F A ® PF B (v) 



< 2^,(3) 

< Pv (4) 



We will also prove: 



Lemma 6 For the uniform distribution on vertices v (induced by the state 
of the system Fa, Fb ), 



~Qvlv < d t , and 



(5) 
(6) 



Thus, for all v: 

Pm a f a f b (v) - Pm a f a ® PF B (v) 
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< 



+ 
+ 



Pm a F a F b (v) - a M A F A F B (v) 

a M A F A F B {v) ~ ° V M A F A ® PF B (v) 
a M A F A ® PF fl («) - Pll A F A ® PF B («) 



< 4^/> + 

Finally, 
£»(M A F^ : F B (Y t+1 )) 

= E ^ Pm a F a F b (v) ~ PM A F A ® AF B (i;) 

< E^V^ + A,] 

< VE^ + E„A, 



From equation (Q) 
From equation (||). 



By Jensen's inequality 
By Lemma ^. 



This completes the proof of Lemma |j. 

We finish the proof of Theorem 3 by proving Lemmas [5] and ^. 

Proof of Lemma [|: 

For equation (pi) notice that 



PM A F A ~ a M A F A 



< 



< 



Pm a F a F b (v) a M A F A F B {v) 



PM A F A M B F B R ~ VMaFaMbFbR 



and by fact 6 this is at most 2^/7^. 
For equation (0), 



a M A F A F B (v) ~ a M A F A ® PF B (v) 



< 



°M A F A RF B (v) U M A F A R ® PF B («) 
PM A F A RF B (v) - PM A F A R (8) PF s (u) 



PM A F A F B (v) - PM A F A ® Pf b (» 



By unitarity 

(*) 

By definition (g). 



For (*) notice that i? holds the path Px, . . . , Vt+i, which is determined 
by MaFa- We can apply a unitary transformation that "erases" this, to 
give us the state Pm a f a - The lemma is proved. 
Proof of lemma ||: 
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For equation (|5|), we have 



V 



— E « Pm b f b f a (u) Pm b f b ® Pf a (u) 

= D(M B F B : F A (V t )) 

< D(M B , t F B : F A (V t )) = d t . 



i 



The last step follows from the fact that Bob sends the i'th message, and 
this only decreases the informational distance: D(M Bjt+ iF B : F A (Vt)) < 



Lemma 7 For all t, I{MaiFa '■ F B ) < 25n, irrespective of whether some 
registers have been measured or not. 

In the beginning Alice has no information about F B , i.e., I{Ma,\Fa ■ F B ) = 
0. Recall that at most 5n qubits are communicated in the protocol. Any 
qubit sent from Alice to Bob does not increase her information on Bob's 
input. Any local unitary transformation does not increase her information. 
Now assume Bob sends a qubit Q. Then I(MaQFa ■ F B ) = I(MaFa '■ 
F B )+I(Q : F B \M A F A ) < I(M A F A : F B )+2 due to the Araki-Lieb inequality 
(fact 2) So each qubit sent from Bob to Alice increases her information on 
his input by at most 2. We thus get I{Ma,iFa '■ F B ) < 25n at all times t. 
Since measurements only decrease mutual information, the bound also holds 
when certain registers are measured and others are not, during the course 
of the protocol. The lemma is proved. 

Now consider the situation that F B is distributed uniformly instead of being 
in the uniform superposition (in other words, when F B has been measured). 
Then ~E v I(MaFa ■ F B (v)) < 25 (where v is uniformly random), using that 
the F B {v) are mutually independent. Now, 



where FaMaM b F b are as in the protocol without measurements. Further, 




v 



E u PM A F A F B (v) - PMaFa ® PF B (v) 

B V D(M A F A : F B (v)), 



l 




< ^2B V I(M A F A : F B (v)) 

< Vis, 



by Lemma 2 and Jensen's 



inequality. 



□ 
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6 The Disjointness Problem 



We now investigate the bounded round complexity of the disjointness prob- 
lem. Here Alice and Bob each receive the incidence vector of a subset of 
a size n universe. They reject iff the sets are disjoint. It is known the 
Ql(DISJ n ) > (1 - H(e))n gj, §■ Furthermore Q(DISJ n ) = O(^Elogn) 
by an application of Grover search M, In this protocol G(y / n) rounds are 
used. By a simple reduction (see [G1J) we get the following result. 

Theorem 4 Q k * ub (DISJ n ) = tt{n l / k ) for k = 0(1). 

Proof: Suppose we are given a k round quantum protocol for the disjoint- 
ness problem having error 1/3 and using communication c. W.l.o.g. we can 
assume Bob starts the communication, because the problem is symmetrical. 
We reduce the pointer jumping function to disjointness. 
In a bipartite graph with 2n vertices and outdegree 1 there are at most 
n k possible paths of length k starting at vertex v\ . For each such path we 
use an element in our universe for the disjointness problem. Given the left 
resp. right side of a specific graph Alice and Bob construct an instance of 
DISJ n k . Alice checks for each possible path of length k from v\ whether the 
path is consistent with her input and whether the paths leads to a vertex 
Vk+2 with odd number (if the k + 1st vertex is on the left side). In this case 
she takes the corresponding element of the universe into her subset. Bob 
does the analogous with his input. Now, if the two subsets intersect, then 
the element in the intersection witnesses a length k + 1 path leading to a 
vertex with odd number. If the subsets do not intersect, then the length 
k + 1 path from v\ leads to a vertex with even number. 
So we obtain a k round protocol for in which Bob starts. The communi- 
cation is c = J7(n) for any constant k, the input length for the constructed 
instance of disjointness is N = n k and we get Q k ' pub (DISJN) = ^(N 1 ^) 
for k = O(l). □ 



Acknowledgement 

The author wishes to thank Ashwin Nayak and Amnon Ta-Shma for im- 
proving the presentation of theorem 3 and for helpful comments, as well as 
Pranab Sen for pointing out a mistake in an earlier version of the proof of 
theorem 1. 



References 



[1] D. Aharonov, A. Kitaev, N. Nisan. Quantum Circuits with Mixed 
States. Proc. 30th ACM Symp. on Theory of Computing, pp. 20-30, 



1998. Also: |quant-ph/9806029 



19 



C.H. Bennett, S.J. Wiesner. Communication via one- and 
two-particle operators on Einstein-Podolsky- Rosen states. 
Phys. Rev. Lett., vol.69, pp. 2881-2884, 1992. 

R.E. Blahut. Principles and Practice of Information Theory. Addi- 
son Wesley, 1987. 

H. Buhrman, R. Cleve, A. Wigderson. Quantum vs. classical com- 
munication and computation. Proc. 30th ACM Symp. on Theory 
of Computing, pp. 63-68, 1998. Also: |quant-ph/980204q . 



H. Buhrman, R. Cleve, R. de Wolf, C. Zalka. Reducing error proba- 
bility in quantum algorithms. Proc. 40th IEEE Symp. Foundations 
of Computer Science, 1999. Also: |cs.CC/9904019| . 

H. Buhrman and R. de Wolf. Communication complexity lower 
bounds by polynomials. In Proceedings of the 16th Annual IEEE 
Conference on Computational Complexity, 2001. 

N. Cerf, C. Adami. Quantum information theory of entanglement 
and measurement. Proc. of Physics and Computation PhysComp, 



pp.65-71, 1996. Also: |quant-ph/9605039 



R. Cleve, H. Buhrman. Substituting quantum entanglement for 
communication. Physical Review A, vol.56, pp. 1201-1204, 1997. 
Also: |quant-ph/9704026| . 



R. Cleve, W. van Dam, M. Nielsen, A. Tapp. Quantum Entangle- 
ment and the Communication Complexity of the Inner Product 
Function. |quant-ph/9708019| , 1997. 



P. Duris, Z. Galil, G. Schnitger. Lower Bounds on Communication 
Complexity. Information & Computation, vol.73, pp. 1-22, 1987. 

L.K. Grover. A fast quantum mechanical algorithm for database 
search. 28th ACM Symp. on Theory of Computing, pp. 212-219, 



1996. Also: |quant-ph/9605043 



B. Halstenberg, R. Reischuk. Different Modes of Communication. 
SI AM Journal Comput., vol.22, pp.913-934, 1993. 

B. Kalyanasundaram, G. Schnitger. The Probabilistic Communica- 
tion Complexity of Set Intersection. SI AM Journal Discrete Math., 
vol.5, pp.545-557, 1992. 

H. Klauck. Lower bounds for computation with limited nonde- 
terminism. 13th IEEE Conference on Computational Complexity, 
pp.141-153, 1998. 

H. Klauck. On quantum and probabilistic communication: Las Ve- 
gas and one-way protocols. 32nd ACM Symp. on Theory of Com- 
puting, pp.644-651, 2000. 



20 



[16] H. Klauck. Quantum Communication Complexity. Workshop on 
Boolean Functions and Applications at the 21th Int. Colloquium on 
Automata, Languages, and Programming, pp. 241-252, 2000. Also: 
tauant-ph/0005032 . 

[17] H. Klauck, A. Nayak, A. Ta-Shma, D. Zuckerman. Interaction in 
Quantum Communication and the Complexity of Set Disjointness. 
33rd ACM Symp. on Theory of Computing, 2001. 

[18] I. Kremer. Quantum Communication. Master's thesis (Hebrew 
University), 1995. 

[19] E. Kushilevitz, N. Nisan. Communication Complexity. Cambridge 
Univ. Press, 1997. 

[20] H. Lo, H. Chau. Why quantum bit commitment and ideal quantum 
coin tossing are impossible. Physica D, vol.120, pp. 177-187, 1998. 

[21] D. Mayers. Unconditionally secure quantum bit commitment is im- 
possible. Phys. Rev. Lett, vol.78, pp.3414-3417, 1997. 

[22] A. Nayak, A. Ta-Shma, D. Zuckerman. Interaction in Quantum 
Communication Complexity. |quant-ph /0005 1 06| , 2000. 

[23] M.A. Nielsen, I.L. Chuang. Quantum Computation and Quantum 
Information. Cambridge University Press, 2000. 

[24] N. Nisan, A. Wigderson. Rounds in communication complexity re- 
visited. SI AM Journal Comput., vol.22, pp. 211-219, 1993. 

[25] S.J. Ponzio, J. Radhakrishnan, S. Venkatesh. The communication 
complexity of pointer chasing: applications of entropy and sam- 
pling Proc. 31th ACM Symp. on Theory of Computing, pp. 602-611, 
1999. 

[26] J. Preskill. Lecture notes on quantum information and quantum 
computation. Web address: 

www . theory . caltech.edu/people/preskill/ph229. 

[27] R. Raz. A Parallel Repetition Theorem. SI AM Journal Comput., 
vol.27, pp.763-803, 1998. 

[28] R. Raz. Exponential Separation of Quantum and Classical Commu- 
nication Complexity. Proc. 31th ACM Symp. on Theory of Com- 
puting, pp. 358-367, 1999. 

[29] A. Ta-Shma. Classical versus Quantum Communication Complex- 
ity. SIGACT News, vol.30(3), pp.25-34, 1999. 

[30] A.C. Yao. Quantum Circuit Complexity. 34th IEEE Symp. Foun- 
dations of Computer Science, pp. 352-361, 1993. 



21 



